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Abstract 

A metric space X has Markov type 2, if for any reversible finite-state Markov chain {Z t } (with 
Zq chosen according to the stationary distribution) and any map / from the state space to X , 
the distance D t from f(Z ) to f{Z t ) satisfies E(D t 2 ) < K 2 t¥,{D\) for some K = K(X) < oo. 
This notion is due to K.Bali (1992), who showed its importance for the Lipschitz extension 
problem. However until now. only Hilbcrt space (and its bi-Lipschitz equivalents) were known 
to have Markov type 2. We show that every Banach space with modulus of smoothness of power 
type 2 (in particular, L p for p > 2) has Markov type 2; this proves a conjecture of Ball. We 
also show that trees, hyperbolic groups and simply connected Ricmannian manifolds of pinched 
negative curvature have Markov type 2. Our results are applied to settle several conjectures on 
Lipschitz extensions and embeddings. In particular, we answer a question posed by Johnson 
and Lindenstrauss in 1982, by showing that for l<q<2<p< oo, any Lipschitz mapping 
from a subset of L p to L q has a Lipschitz extension defined on all of L p . 

1 Introduction 

K. Ball [2] introduced the notion of Markov type of metric spaces, defined as follows. Recall that a 
Markov chain {Zt}^ with transition probabilities := Pr(Zt+i = j \ Z% = i) on the state space 
{1, . . . , n} is stationary if 7Tj := Pr(Z^ = i) does not depend on t and it is reversible if 7Tj aij = TTj aji 
for every i,j £ {1, . . . , n}. 

Definition 1.1 (Ball [2j). Given a metric space (X, d) andp G [1, oo), we say that X has Markov 
type p if there exists a constant K > such that for every stationary reversible Markov chain 
{Z t }^Q on {1, ... , n}, every mapping f : {1, . . . , n} — > X and every time t G N, 

Ed(f(Z t ),f(Z )y < KPtEd(f(Z 1 ),f(Z )r. 

(Here and throughout, we omit some parentheses and write ¥,x p for K(x p ), etc.) The least such K 
is called the Markov type p constant of X, and is denoted M P {X). 
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Ball introduced this concept in his profound study of the Lipschitz extension problem [2] (see 
Section 2), and the notion of Markov type has since found applications in the theory of bi-Lipschitz 
embeddings |46l @] . The main theorem in [2] states that Lipschitz functions from a subset of a 
metric space X having Markov type 2 into a Banach space with modulus of convexity of power 
type 2 (see the definition in (j3J) below) extend to Lipschitz functions defined on all of X. Ball 
showed that M^iL^) = 1, yet apart from Hilbert space and its bi-Lipschitz equivalents, no other 
metric spaces were known to have Markov type 2. Ball asked in [2] whether L p for 2 < p < oo has 
Markov type 2, and more generally, whether every Banach space with modulus of smoothness of 
power type 2 has Markov type 2. Recall that X has modulus of smoothness of power type 2 
if for all x, y in the unit sphere of X, we have \\x + ry\\ + \\x — ry\\ — 2 < K$t 2 for some constant 
Ks = Kg{X) < oo and all r > (see also © below). Here we answer Ball's question positively, 
and prove: 

Theorem 1.2. Every normed space with modulus of smoothness of power type 2 has Markov type 
2. Moreover, for every 2 < p < oo, we have M2{L p ) < A\Jp — 1. 

In conjunction with Ball's extension theorem [2], this implies the following non-linear version 
of Maurey's extension theorem |49| (see below), answering a question posed by Johnson and Lin- 
denstrauss in [2*§] . 

Theorem 1.3. Let X be a Banach space with modulus of smoothness of power type 2 and let Y be a 
Banach space with modulus of convexity of power type 2. Then there exists a constant C = C(X, Y) 
such that for every subset iCI and every Lipschitz mapping f : A — > Y , it is possible to extend 
ftof:X^Y such that H/llLip < CH/llLip. 

In particular, for X = L p and Y = L q with l<q<2<p< oo, we can take C = 
0(\/(p — l)/(q — 1)) (but even the finiteness of C in this case wasn't known). Additional geo- 
metric applications of Theorem 1 1 . 31 will be discussed later in this introduction. 

Our methods yield Markov type 2 for several new classes of spaces; in particular we have the 
following result, which answers positively a question posed in |56| . 

Theorem 1.4. There exists a universal constant C > such that for every tree T with arbitrary 
positive edge lengths, 

M 2 (T) < C. 

In this theorem, as well as in Theorem 11.51 and Corollary 11.61 below, one can take e.g. C = 30. 
On the other hand, we show in Section |H] that the infinite 3-regular tree satisfies M 2 (T) > \/3- 

In fact, Theorem 1 1.41 is a particular case of the following result which holds for arbitrary Gromov 
hyperbolic spaces. One of many alternative definitions for Gromov- hyperbolic spaces is as follows 
(background material on this topic can be found in the monographs |23 | 18} 124"! I64j ) . Let (X, d) be 
a metric space. For x,y,r E X the Gromov product with respect to r is defined as: 

(x\y) r := d(z,r)+d(y r)-d(z,y)_ (1) 

For 5 > 0, the metric space X is said to be 5-hyperbolic if for every x, y, z,r £ X, 

(x\y) r > mm{(x\z) r , (y\z) r } - 5, (2) 
and X is Gromov-hyperbolic if it is 5- hyperbolic for some 5 < oo. 
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For trees, {x\y) r is precisely the distance of the least common ancestor of x and y from the root 
r. It is easy to conclude (and is well known) that every tree is O-hyperbolic. Thus, the following 
theorem generalizes Theorem 11.41 

Theorem 1.5. There exists a universal constant C > such that for every 5-hyperbolic metric 
space X , every stationary reversible Markov chain {Zt}^2. on {1, . . . , n}, every f : {1, . . . , n} — ► X 
and every time t > 1, 



Hyperbolic groups are finitely generated groups on which the word metric is <5-hyperbolic for 
some < 5 < oo (see EH])- Since the minimal distance in such metric spaces is 1, we have the 
following corollary of Theorem 11.51 

Corollary 1.6. There exists a universal constant C > such that for every 5-hyperbolic group G 
equipped with word metric, 



In Section [3 we also prove the following result: 

Theorem 1.7. Let X be an n dimensional, complete, simply connected Riemannian manifold 
with pinched negative sectional curvature, i.e., its sectional curvature takes values in the interval 
[— R, —r], where < r < R < oo. Then X has Markov type 2 and M%(X) can be bounded by a 
function of n,r, R. 

This paper is organized as follows. In Section [2] we discuss the history and motivation of Ball's 
Markov type 2 problem, as well as the background we require from the geometry of Banach spaces. 
In Section |3] we briefly describe some applications of our results to the extension problem for 
Holder functions and bi-Lipschitz embeddings. Section 0] deals with the behavior of Markov chains 
in smooth normed spaces, and contains the solution of Ball's Markov type 2 problem. Section [5] 
deals with the case of trees and hyperbolic metric spaces. Section Q proves that the Laakso graphs 
have Markov type 2. It is also proved there that doubling spaces and planar graphs have a weak 
form of Markov type 2. Finally, Section |H1 contains some open problems. 

2 Linear and non-linear type and cotype; the linear and Lipschitz 
extension problems 

The classical Jordan-von Neumann theorem [HJ states that Hilbert space is characterized among 
Banach spaces by the parallelogram identity \\x + y|| 2 + \\x — y\\ 2 = 2||x|| 2 + 2||y|| 2 . Induc- 
tively it follows that the generalized parallelogram identity 2~ n Yl ei £{-i +1} + • • • + £n^n|| 2 = 
Y^i=\ \\ x i\\ 2 encodes the rich geometric structure of Hilbert space. In the early 1970's, the work 
of Dubinsky-Pelczyhsky-Rosenthal ^H]; Hoffmann- Jorgensen [22], Kwapien (HE], Maurey [IB] and 
Maurey-Pisier |52| has led to the notions of (Rademacher) type and cotype, which are natural re- 
laxations of the generalized parallelogram identity. A Banach space X is said to have type p > if 
there exists a constant T > such that for every n and every x\, . . . , x n G X, 



E d(/(Z t ), f(Z )) 2 <C 2 tE d(/(Zi), f(Z )) 2 + C 2 5 2 (log t) 2 . 



M 2 (G) < C(l + <5). 




£i6{-l,l} i=l 



i=l 
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The least such constant T is called the type p constant of A, and is denoted T p (X). Similarly, X is 
said to have cotype q if there exists a constant C > such that for every n and every x±, . . . , x n £ X, 



The least such constant C is called the cotype q constant of X, and is denoted C q {X). 

These seemingly simple parameters turn out to encode a long list of geometric and analytic 
properties of a normed space X, and in the past three decades the theory of type and cotype has 
developed into a deep and rich theory. We refer to the survey article , the books jBJ ESI EHU 
1661 114j and the references therein for numerous results and techniques in this direction. 

A fundamental result of Maurey HHj states that any bounded linear operator from a linear 
subspace of a Banach space with type 2 into a Banach space of cotype 2 extends to a bounded 
linear operator defined on the entire space. More precisely: 

Theorem 2.1 (Maurey 's extension theorem). Let X, Y be Banach spaces, Z a linear subspace 
of X and T : Z — > Y a bounded linear operator. Then there exists a linear operator T : X — > Y 
such that f\ z = T and \\f\\ < T 2 (X)C 2 (Y) \\T\\. 

The extension problem for linear operators between Banach spaces is a linear variant of the 
classical Lipschitz extension problem which asks for conditions on a pair of metric spaces X, Y 
implying that every Lipschitz function defined on a subset of X taking values in Y can be extended 
to a Lipschitz function defined on all of X, with only a bounded multiplicative loss in the Lipschitz 
constant. Formally, let X, Y be metric spaces and denote by e(X, Y) the least constant K such 
that for every ZCI every Lipschitz function / : Z — > Y can be extended to a function / : X — > Y 
satisfying ||/||Lip < if II f II Lip (if no such K exists we set e(X,Y) — oo). Estimating e(X,Y) under 
various geometric conditions on X and Y is a classical problem which dates back to the 1930's. 
We refer to |651 E] |^ and the references therein for an account of known results on Lipschitz 
extension. 

The modern theory of the Lipschitz extension problem between normed spaces starts with the 
work of Marcus-Pisier [IT] and Johnson-Lindenstrauss j^H]- In 122] it is asked if there is a non- 
linear analog of Maurey's extension theorem. To investigate this question it is clearly necessary 
to develop non-linear variants of type and cotype. While there has been substantial progress in 
the past 20 years on non-linear notions of type, a satisfactory notion of non-linear cotype remains 
elusive. Enflo |17U18llT9*| studied the notion of roundness of metric spaces and subsequently in |2U| . 
generalized roundness to a notion which is known today as Enflo type. Let X be a metric space 
and fix n £ N. An n-dimensional cube in A is a mapping e i— ► x £ from {—1, 1}" to A. A is said to 
have Enflo type p with constant K if for every n and every n dimensional cube x : {—1, 1}™ — ► A, 



where e ~ e' if ||e — e'||i = 2. 

In the case of normed spaces A, Enflo type p clearly implies Rademacher type p — this follows 
by considering cubes of the form x £ = Y17=i e * Vi> w h ere D\i ■ ■ ■ ■, Vn £ A. A variant of Enflo type 
was introduced and studied by Bourgain, Milman and Wolfson in [2] (see also |59|). In jSE], it was 
shown that for a wide class of normed spaces, Rademacher type p implies Enflo type p. Despite 




£*£{-!, 1} »=1 



i=l 



d(x £ ,X- £ )P<KPj2 d ( X z> X e') 
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the usefulness of these notions of non-linear type to various embedding problems, they have not 
yielded extension theorems for Lipschitz functions. 

A breakthrough on the Lipschitz extension problem was obtained in the paper of Ball [2] , where 
he introduced the notion of Markov type p (which, as shown in [SE], implies Enflo type p). 

To state the main result of [2] we need to recall the notions of uniform convexity and smoothness 
of normed spaces (see |44j for a detailed account of this theory). Let (X, || • ||) be a normed space. 
The modulus of uniform convexity of X is defined for e G [0, 2] as 

5 x (e) = M<1 : x,y £ X, \\x\\ = \\y\\ = 1, ||x-y||=eS. (3) 

The normed space X is said to be uniformly convex if 5x (e) > for all e G (0, 2]. Furthermore, 
X is said to have modulus of convexity of power type q if there exists a constant c such that 
5(e) > ce ? for all e G [0,2]. It is straightforward to check that in this case q > 2. By Proposition 
7 in [3] (see also |21j). X has modulus of convexity of power type q if and only if there exists a 
constant K > such that for every x, y G X 

2 IMI 9 + ™ IMI 9 ^ ll x + y\\ q + ll x - y\\ q - ( 4 ) 

The least K for which holds is called the q-convexity constant of X, and is denoted K q {X). 
The modulus of uniform smoothness of X is defined for r > as 

px{T) = sup <^ 1 : x, y G X, ||x|| = ||y|| = IS . (5) 

X is said to be uniformly smooth if lim T „>o Px ^ = 0. Furthermore, X is said to have modulus of 
smoothness of power type p if there exists a constant K such that px{ T ) < -fCr p for all r > 0. It is 
straightforward to check that in this case necessarily p < 2. By Proposition 7 in Pj, px{t) < -fTr p 
for every r > if and only if there exists a constant S > such that for every x,y & X 

\\x + + Ik - y|| p < 2 ||x|| p + 2 S p \\y\\ p . (6) 

The least S for which (jHJ) holds is called the p-smoothness constant of X, and is denoted S P (X). 

It was shown in [3] (see also (U) that K^iLp) < l/\/P — 1 for 1 < p < 2, and S2(L p ) < yjp — 1 
for 2 < p < oo (the order of magnitude of these constants was first calculated in ) • 

In |22M21j (see also 01], Theorem I.e. 16.) it is shown that if a Banach space X has modulus of 
convexity of power type q then X also has cotype q. Similarly, if X has modulus of smoothness of 
power type p then X has type p. Observe that L\ has cotype 2 (see 01102]); but it is clearly not 
uniformly convex. There also exist spaces of type 2 which are not uniformly smooth |28[ I61j , but 
these spaces are much harder to construct. For all the classical reflexive spaces, the power type of 
smoothness and convexity coincide with their type and cotype, respectively. 

Thus, in the context of uniformly convex and uniformly smooth spaces, one can ask the following 
variant of the Johnson-Lindenstrauss question: is it true that e(X, Y) < oo whenever X is a Banach 
space with modulus of smoothness of power type 2 and Y is a Banach space with modulus of 
convexity of power type 2? This is precisely the problem studied by Ball in [2], who proved the 
following theorem: 

Theorem 2.2 (Ball's extension theorem). Let X be a metric space, and Y a Banach space 
with modulus of convexity of power type 2. Then e(X,Y) < 6M%(X) K2(Y). 
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In particular, Ball showed that M 2 [L2) = 1> whence e(L2,L p ) < 6/y/p — 1 for 1 < p < 2. 
Subsequently, in a difficult paper that is not based on the Markov type approach, Tsar'kov \6'A\ 
showed that every Banach space X with modulus of smoothness of power type 2 satisfies e(X, L 2 ) < 
oo. 

Here we prove the following result: 

Theorem 2.3. Fix 1 < q < 2 and let X be a normed space with modulus of smoothness of power- 
type q. Then 

In particular, for every 2 < p < oo, 

M 2 (L P ) < 4 V / P 3 ^- 

In conjunction with Ball's extension theorem, we obtain a non-linear analog of Maurey's exten- 
sion theorem: 

Theorem 2.4. For every two Banach spaces X,Y, 

e{X,Y) < 24S 2 (X)K 2 (Y). 
In particular, for 2 < p < oo and 1 < q < 2, 

e(L p ,L q ) < 24 

3 Some additional geometric applications 

In [nnj the extension problem for Holder functions was studied. Let X, Y be metric spaces. Recall 
that a function / : X — > Y is a Holder with constant K if for every x,y G X, d(f(x),f(y)) < 
Kd(x,y) a . Following [H5j We denote by A(X,Y) the set of all a > such that for all D C X 
any a Holder function / : D — ► Y can be extended to an a Holder function defined on all of X 
with the same constant. Analogously, B(X, Y) denotes the set of all a > for which there exists 
a constant C > such that for all D C X any function / : D — > Y which is a Holder with 
constant K can be extended to a function defined on all of X which is a Holder with constant CK. 
In [SSI the- following theorem was proved, which shows that the isometric and isomorphic extension 
problems for Holder functions between L p spaces exhibit a phase transition: for 1 < p, q < 2 we 
have A(L p , L q ) = (0,p/q*] while B(L p , L q ) = (0,p/2], where q* = q/{q— 1) (note that for 1 < q < 2 
we have p/q* < p/2). Additionally, for any p/2 < a < 1 there is an a Holder function from a subset 
of L p to L q which cannot be extended to an a Holder function defined on all of L p . 

The sets A(L P , L q ) were calculated in |65| for all values of p, q. It is shown there that: 

' (0,p/q*], ifl<P,?<2, 

(0,p/q] , ifl<p<2<<7<oo, 

(0,p*/q], \i2<p,q<oo, 

(0,p*/q*], if 2 < p < oo and 1< q < 2. 




A(L p ,L q ) = < 
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As noted in [22], the solution of Ball's Markov type 2 problem (i.e., Theorem 12 .3|) completes the 
computation of the sets B(L P , L q ): 

(0,p/2], ifl<p,g<2, 
(0,p/q\ , ifl<_p<2<g<oo, 
(0,2/g], if2<;p,g<oo, 
(0, 1] , if 2 < p < oo and 1 < q < 2. 



B(L p , L q 



We end with some applications of our results to the theory of bi-Lipschitz embeddings of finite 
metric spaces into normed spaces. Given two metric spaces X, Y and a one-to-one mapping / : 
X ^ Y, its distortion is defined as dist(/) = ||/||Lip • ||/ _1 ||Lip- In fTOl H] it was shown that if G is 
a finite graph with girth g (i.e., the length of the shortest closed cycle in G is g) and average degree 
5 > 2, then every embedding of G (with the graph metric) into Hilbert space incurs distortion at 
least ^jf\fg~- In fact, the same proof shows that any embedding of G into a metric space X of 
Markov type p incurs distortion at least 2M~nc) 9 ^ e thus obtain new classes of spaces into 

which large girth graphs cannot embed with low distortion. In particular, any embedding of G into 
L p for p > 2, incurs at least the distortion ^Fr-y/g/p. 



In [1] the metric Ramsey problem for the Hamming cube = {0, l} d (with the L\ metric) 
was studied, and it was shown that if A C embeds in Hilbert space with distortion D, then 
\A\ < l^l 1 -^ 2 . (Here, c* is a universal constant. This estimate is shown in 0] to be optimal up 
to logarithmic terms.) The proof of this fact is heavily based on the analysis of Markov chains on 
subsets of the cube, and on the fact that Hilbert space has Markov type 2. The same result holds 
true for embeddings into any metric space X of Markov type 2, with the constant c replaced by 
c*/M 2 (X) 2 . 

In it was was shown that any embedding of the Hamming cube {0, l} d into L p , p > 2, 
incurs distortion at least c p ^/d, where c p depends only on p. Using Theorem 12.31 it is possible to 
obtain the optimal dependence of c p on p, namely any embedding of {0, l} d into L p , p > 2, incurs 
distortion at least ^ max{l, sj d/p}. This follows directly from Theorem 12.31 bv considering the 
Markov chain corresponding to the standard random walk on the Hamming cube. The fact that 
this lower bound is optimal follows from the following embedding. Let ei, . . . ed be i.i.d. symmetric 

±1 valued Bernoulli random variables and map x £ {0, l} d to the random variable Y x := Yli=i x i £ i- 

(, p\ i/p 
E Yli=i £ i ) = ® (min{/c, \/pk}) , 

implying that for p < d and x, y G {0, l} d , cy 1 pjd- \x — y\\\ < (E\Z X — Z y \ p ) 1 ' p < \\x — y\\\, where c 
is a universal constant. Thus the embedding x *— > Z x incurs distortion Q(y/d/p). An application of 
having good bounds in terms of p on the L p distortion of the Hamming cube is the following. In [12] 
it was shown that if {0, l} d embeds into with distortion D then k > 2^ d ^ D2 \ This fact easily 
follows from the above discussion via an argument similar to that of [10]: By Holder's inequality 
l^a and £p are 0(1) equivalent when p = log A;. Thus, the fact that {0, l} d embeds into with 
distortion D implies that D > VL(\/d/p) = dj log k), which simplifies to give the claimed lower 
bound on k. 

4 Markov chains in smooth normed spaces 

We begin by recalling why the real line has Markov type 2. Let {Zt}fjL Q be a stationary reversible 
Markov chain on {1, . . . ,n}, with transition matrix A = (ay), and stationary distribution ir. The 
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Markov type 2 inequality (with constant 1) for R simply says that for every x±,...,x n E R, we 



have Yli j (A^ij (xt - xj) 2 < t ^ ,■ ^ a^- 



In fact, the stronger inequality 



TTj (^.*)tj (»i - Sj) 2 < A(t) dij TTi (Xi - Xjf 



(7) 



holds, where A(t) := = 1 + A + • • • + A* -1 and A is the second largest eigenvalue of A (Recall 
that — 1 < A < 1.) Indeed, since 7Tj = 7r,- (A t )ji, expanding the square in ((7J) shows that 

inequality may be rewritten 



((I-A t )x,x) n <A(t)((I-A)x,x) n 



(8) 



where x is the column vector (x\, . . . ,x n ), and (•, -} n refers to the inner product in L 2 (ir). If x is 
an eigenvector, then is clear; the general case follows by a spectral decomposition (note that A 
is self-adjoint on L 2 (tt)). 

The proof of the following lemma is a slight modification of the proof of Lemma 3.1 in Let 
L q (X) denote the collection of Borel measurable f:[Q,l]-+X with E||/||« = ft \\f\\ q < oo. It is a 

Banach space with norm (EH/H 9 ) 1 ^ 9 . (Of course, the choice of the interval [0, 1] as the domain for 
/ is rather arbitrary. It may be replaced with any probability space equivalent to [0, 1].) 

Lemma 4.1. Fix 1 < q < 2 and let Z E L q (X). Then 

E\\z\\ q < \\EZ\\ q + Sq [ x " >9 . m\z - EZ\\ q . 

II II - II II 2 g_l _ 1 II II 

Proof. Let 9 > be the largest constant such that for every Z E L q (X), 

9 (E\\Z\\ q - \\EZ\\ q ) < E\\Z - EZ\\ q . 

Our goal is to show that 9 > (2 q ~ 1 - l)S q (X)- q . To this end, fix e > and Z E such that 

(0 + e) (E||Z|| 9 - \\EZ\\ q ) >E\\Z - EZ\\ q . 

By the definition © of S q {X) applied to the vectors x = (Z + EZ)/2 and y = (Z — EZ)/2, we 
have the pointwise inequality: 



||Z|| 9 + ||EZ|| 9 < 2 

Taking expectations, we find that 
1 



-Z + -EZ 
2 2 



+ 2S 9 (X) £ 



-Z - -EZ 
2 2 



-E||Z-EZ|| <? < E\\Z\\ q - \\EZ\\ q 



< 2 ( 
2 



< 



■ E 



-Z + -EZ 
2 2 

-Z — -EZ 
2 2 



E ( -Z + -EZ 
2 2 



+ 2 5g(X) 9 E 



-Z - -EZ 
2 2 



+ 2 5 9 (X)' ? E 



-Z - -EZ 
2 2 



It follows that < | + 2S' (? (X)' J . Letting e tend to zero, and simplifying, yields the required 
result. □ 



S 



The following theorem, first proved in [57] (without the explicit constant), is a simple corollary 
of Lemma 14. II (see also Proposition 3.3 in |2j). 



Theorem 4.2 (Pisier). Fix 1 < q < 2 and let {Mfc}'£ =0 C L q {X) be a martingale in X. Th 



en 

E||M n - M Q \\* < ^r^j ■ E E H M *+i - M ^ 9 - 

k=0 

Proof. Assume that {Mfc}^ =0 is a martingale with respect to the filtration J-q C J^i C • • • C ,F n — i; 
that is, E (Mj+il.Fj) = Mi for i = 0, 1, ...,n — 1. By Lemma |4.1l with conditioned expectation 
replacing expectation, 

E(||M„ - Mo|| 9 |^ n -i) < ||Af B _i - M f + ^prri ■ E (H M « " Mn-^lFn-i)- 

Hence 

E||M„ - Moll 9 < E ||M n _! - M f + ^ q }P\ ■ E||M n - M„_i||«, 

and the required inequality follows by induction. □ 

The following lemma is motivated by the continuous martingale decompositions of Stochastic 
integrals constructed in 



Lemma 4.3. Let X be a normed space, {Zt}^ a stationary reversible Markov chain on {1, . . . , n} 
and f : {1, . . . , n} — > X. Then for every t G N there are two X-valued martingales {M s }* =0 and 
{N s } s=0 (with respect to two different filtrations) with the following properties: 

1. For every 1 < s < t — 1 we have that 

f(Z s+ i) - f{Z s _ x ) = (M s+ i - M a ) - (N t _ s+1 - Nts). (9) 

2. For every < s < t — 1 and q > 1, 

max{E||M s+1 - M s \\" ,E\\N S+1 - N s \\ g } < 2m\\f(Z 1 ) - f(Z )\\ q . (10) 

Proof. Let A = (a^) be the transition matrix of Zt, and let 7Tj := Pr(Zo = i). Define 

n n 

Lf(i) = Y,a lJ [f(j) - /(»)] =Y,aisM ~ /(»)■ 

3=1 3=1 

Then 

E(/(Z S )|Z 0) • • • , = E(/(Z,)|Z a _i) = L/(Z s _x) + Z,_i. 
Since {Z s }^ is reversible, we also have that for every < s < t, 

E(f(Z s )\Z s+1 , ...,Z t ) = Lf(Z s+1 ) + f(Z s+1 ). 

It follows that if we define Mo = f(Zo) and for s > 1: 

s-l 

M s := f{Z s )~Y J Lf{Z r ). 

r=0 
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then 



s-1 



E(M S \Z , Z s _x) = L/(Z a _i) + /(Z s _x) - ^ L/(Z r ) = M s _ x , 



r=0 



i.e., {M S } ( £L is a martingale with respect to the natural filtration induced by {Z s }^ . 
Now, define Nq := f(Zt) an d for 1 < s < t 



N a :=f(Z t -s)- ]T Lf(Z r 



Then for s > 1, 



E(A^|Z 



t-S + 1) 



r=t-s+l 



Zt) = iV, 



S-l) 



i.e., {A^ s }* =0 is a martingale with respect to the natural filtration induced by Zt,Zt—i, 
probabilistic terminology, {iVt_ s }* =0 is a reverse martingale). 
The identities 



Z (in 



M s+1 - M s = f(Z s+1 ) - f(Z s ) - Lf(Z s ), N s+1 -N, = f(Zt^x) - /(Z t _ s ) - Lf(Z t _ s ). 

imply ©. To prove (fTU)) observe that for every s > and q > 1, 



n n 



E||L/(Z s )f = £>|p % [/(J) - /(f)] < J] J>ay||/(3) - /Wll 9 = E||/(Zi) - f{Z Q )f. 

i=l j=l i=l j=l 

Therefore, 



E||M S+1 - M s \\i = E||/(Z S+1 ) - f(Z s ) - Lf(Z 8 )\\ q 

< 2«- 1 E||/(Z s+1 ) - f(Z s W + 2«- 1 E||L/(Z s )r < 2«E||/(Z 1 ) - /(Z )f, 

and similarly, E||iV s+1 - JVJ* < 2«E||/(Zi) - /(Zq)!! 9 . □ 

Proof of Theorem \2.fA Let {Z s }* =0 , /, {M s }* =0 and {-/V s }* =0 be as in Lemma FOl Assume first 
that t is even, and write t = 2m. Summing the identity © over s = 1, 3, 5, ... , 2m — 1 we get 

t/2 t/2 

f(Z t ) - /(Zo) = J2(M 2k - M 2fc _x) - ]T(iV 2fe - N 2k -i). 

k=l k=l 

By Theorem 14.21 (applied to the martingales ^2 k= \(M 2k — M 2k -\) and ^2 k =\(^2k — ^2fc-i))> we 
conclude that 

t/2 t/2 

E\\f(Z t ) - f(Z )\\« < 2^ X E ||£(M 2fc - Af 2 fc_i)| 9 + 2^E \\j2(N 2k - N 2k ^] 

k=l k=l 
t/2 



nq— 1 C ( Y")Q 

fi-J- £(E \\M 2k - M 2k ^f + E \\N 2k - N 2k ^r) 

k=l 



2i- 



< 



2 ^^ t -2^E\\f{Z l )-f(Z, 
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When t is odd, apply the above reasoning at time t — 1, to get 



E\\f(Z t )-f(Z )\\ q < 2 q - 1 E\\f(Z t ^)-f(Z )\\ q + 2 q ~ 1 E\\f(Z t )-f(Z^ 1 ) 

< ( 2 ^ q - S / X 1 )q (*-*) + 29 ^) E ~ f( Z °)W q 

o3<j— 2 c ( 

< tv\\f(Zi)-f(ZoW. 



□ 



There are various natural variants of the notion of Markov type. The following theorem deals 
with other moment inequalities for Markov chains in uniformly smooth normed spaces: 

Theorem 4.4. For every p £ (l,oo) and q G (1,2] there is a constant C(p,q) £ (0, oo) with the 
following properties. Let X be a normed space with modulus of smoothness of power type q. Then 
for every reversible Markov chain on {1, . . . ,n}, {Zt}^ , and every f : {1, . . . ,n} — ► X, if p < q, 
then 

e \\f(z t ) - f(z w < c( P , q ) s q {xym \\f{z x ) - f(z )\\ p , 

i.e., X has Markov type p. If p > q, then 

E\\f(Z t ) - f(Zo)\\ p < C(p,q)S q (Xrt^E\\f(Z l ) - f(Z W ■ 

Proof. Observe that by the definition of S q (X), for every r > 0, px(i~) < S q (X) q r q . Since Px(t) < 
r, assuming p < q and px(i~) < Kr q , where K > 1, we have Px{t) < Kt p . By (the proof of) 
Proposition 7 in [3] , it follows that for 1 < p < q, S P (X) < C S q (X) q / p , where C depends only on 
p, q. The first result now follows from Theorem 12.31 

Assume now that p > q. To prove the second assertion, observe that by a theorem of Figiel |21j 
(combined with Proposition 7]), S q (L p (X)) < CS q (X), where C depends only on p,q. Recall 
that a sequence of elements x\, X2, ■ ■ ■ in a Banach space Y is called a monotone basic sequence if 
for every a\, a2, ■ ■ ■ 6l and every integer n, 



i=l i=l 



n+1 



By a result of Lindenstrauss [13] ( see also |571 Proposition 2.2]), for a Banach space Y, if Py( t ) < 
Kt q for all t, then for every monotone basic sequence {£j}«>o in Y, 

n n 
i=l i=l 

Let {Mj}j>o C L p {X) be an X-valued martingale. By convexity, it follows that {(Mj — Mj_i)}j>i 
is a monotone basic sequence in L p (X). Moreover, py{ T ) < S q (Y) q r q for every normed space Y. 
Hence, for every n, 



\ M n-Mo\\ 9 Lp{X) < l±CS q (X)]«J2\\ M i- M i-l\\ q L p 



(xy 
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In other words, 

/ n \p/i 
E\\M n - M \\ p < [ACS q {X)f I J^(E||Mi - M^if ) q/p \ = [ACS q {X)f n p / q E \\M X - M \\ p . 

We now conclude the proof exactly as in the proof of Theorem 12.31 □ 

We now state the case X = IR, q = 2 of the second part of Theorem 14.41 with explicit constants. 

Theorem 4.5. Let p £ (2,oo). for every reversible stationary finite Markov chain {Zt}^ on 
{1, . . . , n} and every f : {1, . . . , n} — > R, we have for all t > 1: 



(E\f(Z t ) - f{Z )\ P Y' p < 16 mf{Zi) ~ f(Z 

The proof follows from the above argument, using S^Lp) < y/p — 1 for p > 2 (0, see also [2*T]). 
By considering the standard random walk on {1,2, ... ,n} it follows that the dependence on p in 
Theorem 14.51 is optimal, up to a universal multiplicative factor. 

5 Markov chains in trees and hyperbolic metric spaces 

In this section we prove Theorem 11.51 and Theorem 11.71 We do not attempt to optimize the 
constants. In particular, in the case of trees a more careful analysis shows that one may take C = 8 
in Theorem ll.41 Since we do not believe that this is the optimal constant, we use rougher estimates. 
In Section |S] we show that the infinite 3-regular tree satisfies M2(T) > \/3. 

Lemma 5.1. Let {Z t }^ be a reversible Markov chain on {1, ... , n} and f : {1, . . . , n} — > R. Then 
for every time t > 0, 

E max [f(Z s ) - f(Z )} 2 < 100tE[/(Zi) - f(Z )} 2 . 

0<s<t 

Naturally, the proof relies on Doob's L 2 maximal inequality for submartingales 

E max M 2 < 4E|M 4 | 2 . (11) 

0<s<t 

See, e.g., Q21 §4.4]. 

Proof. Let {M s }* =0 and {iY s }* =0 be as in Lemma H31 Observe that 



max[/(Z s )-/(Z )] 2 < 2 max \f(Z s ) - f(Z )] 2 + 2 max \f(Z s ) - f(Z s ^)] 

0<s<t 0<s<t 

s even s odd 

< 2 max[/(Z s )-/(Z )] 2 + 2 ^ [f(Z s ) - f(Z s ^)f 

0<s<t *■ — ' 
s even 

s odd 



2 



Therefore, 



E max :[f(Z s ) - f(Z )} 2 < 2E max \f(Z s ) - f(Z )} 2 + (t + 1) E[/(Zi) - f(Z )} 2 . (12) 

0<s<t 0<s<t 
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For even s we have as in the proof of Theorem 12.31 that 



s/2 s/2 

f(Z s ) - f(Z ) = ^(M 2fc ~ M 2k ^) - ^(N 2k ~ N 2k . 

k=l k=l 



Thus, Doob's inequality gives 



E max IfiZs) — f(Zn)r < 2E max 
o<s<t UK SJ Jy u;j - 0<i<t/2 

s even 



,fc=l 



+ 2E max 

0<i<t/2 



k=l 



8E 



L*/2J 

{M 2k - M 2k _ 1 ) 

k=l 



+ 8E 



[t/2| 
fc=l 



L*/2J 

= 8 E ( M 2fc - M 2fe „!) 2 + 8 ]T E(iV 2fc - N 2k ^f 

k=l k=l 

< 32tE[/(Z 1 )-/(Z )] 2 , 



(13) 



where in (j!3[) we have used the fact that the martingale differences are orthogonal and for the next 
inequality we used (|1U|). 

Together with (|12j). this concludes the proof. □ 

In what follows we use the notation of the Gromov product (fT|). 
Lemma 5.2. Zei X be a 5-hyperbolic metric space. Then for every m > 1 and r, xo, • • • , x m £ X 

(i(xo,x m ) 2 < 4 max [<i(xo, r) — d(xj, r)] 2 + 4 max [d(x m , r) — d(xj, r)] 2 + 

0<j<m 0<j'<m 



m—1 



m 



+ 4 ^ d(xj-,a: i+ i) a + 16 5 2 [log 2 

j=0 

Proof. Suppose first that m is a power of 2, m = 2 k . Then © gives 



(Xo\x m )r > min{(xokm/2)r, (%/2^ra)r} ~& 



Hence, induction gives, 



(xo|x m ) r > min{ (xj|xj + i) r : i = 0, 1, . . . , m — 1} — k 5 . (14) 

This also holds when m is not a power of two, provided we take k := [log 2 m] (for instance, define 
Xi := x m for m < i < 2 k ). Let j be the index i 6 {0, 1, ... ,m — 1} giving the minimum in (|14[). 
Then 

d(x , x TO ) = d(xo, r) + d(x m , r) - 2 (xo, %) 

< 2/c<5 + d(x ,r) + d(x m ,r) - 2(xj\x j+ i) 

= 2k 5 + (d(x ,r) - d(xj,r)) + (d(x m ,r) - d(xj +1 ,r)) + d(xj,x J+ i) . 

This implies 

<i(x ,x m ) 2 < 4(2/c5) 2 + 4 (d(x ,r) - d(x-,-, r)) 2 + 4 («i(x m , r) - d(xj+i, r)) 2 + 4d(xj, x^+i) 2 . 
The lemma follows. □ 
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We can now prove Theorem II. 51 



Proof of Theorem \1.5\ Fix r £ X. By Lemma 15.21 for every t > 1, 
Ed(f(Z t )J(Z )) 2 < 4E„mflxJd(/(Z ),r)-d(/(Z i ) J r)] 2 + 



o<i<t-i 

+ 4E max [d(/(Z t ), r) - d(f(Z j+1 ), r)] 2 + 4tEd(/(Zi), /(Z )) 2 + 
0<j<t— 1 

+ 16 5 2 Tlog 2 £] 2 . 

By Lemma 15. II applied to the function g(i) := d(f(i),r) we get 
E max \d(f(Z ),r)-d(f(Z j ),r)} 2 < lOOtE [d(f(Z l ),r)-d(f(Z ),r)} 2 < 100tEd(/(Zi), /(Z )) 2 . 

0<j<£— 1 

Similarly, since the Markov chain is reversible, 

E max [d(/(Z t ), r) - d(f(Zj), r)] 2 < 100 i E d{f(Z{), f(Z )) 2 . 

0<3<t 

The proof of Theorem 11.51 is thus complete. □ 
5.1 A lower bound for the Markov type constant of trees 

The following example shows that the infinite 3-regular tree satisfies M2(T^) > \/3. Fix an 
integer h and let be the complete binary tree of depth h rooted at r. For z G denote 
\z\ = d(z,r). Consider the following transition kernel on T/j. If z has three neighbors, it goes to its 
neighbor closer to r with probability 1/2 and to each of the other two neighbors with probability 
1/4. Otherwise [z has a single neighbor or z = r), it goes to any of its neighbors with equal 
probability. This is the transition kernel for a stationary reversible Markov chain {Zj}^ with 
stationary distribution Pt(Zq = z) = 2~\ z \/(h + 1). Denote St = \Zq\ — \Z t \ and observe that for 
every positive integer n, conditioned on the event {n < \Zq\ < h — n}, the sequence {St}t< n has the 
same distribution as {St}t<n, the simple random walk on Z starting at 0. Denote M n := maxt< n St 
and M n := max« n St- By a theorem of Pitman |H2], {2M n — 5' n } n >o has the same distribution as 
{S n }n>o conditioned on S n > for all n > (which is defined as the limit as m — > oo of {S n } n >o 
conditioned to hit m before revisiting 0). Let S n denote this conditioned walk, which is a Markov 
chain with transition probabilities p(x,x ±1) = (x ± l)/(2x). Induction easily gives ES^ = 3n, 
and therefore, 

E((2M n -5 n ) 2 | n < \Z \ < h-n) = Sn. 

2n 



Since Pr(|Zo| G [n, h — n]) = 1 — ^tj, we get 

E(2M n -S n ) 2 >3(l-— -Jn. (15) 

Let f be the (unique) vertex in {Zo, . . . , Z n } closest to the root, so that \Zq\ — \v\ = M n . Note 
that v need not be on the geodesic connecting Zq to Z n . Nevertheless, it is unlikely to be far 
from this geodesic. We now make this precise. Conditioned on Zo,v,M n and S n , the vertex Z n 
is distributed uniformly among the vertices u such that \u\ = S n that are descendents of v (that 
is, the simple path in from u to the root contains v). Therefore conditioned on Zq,v,M u the 
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random variable d(Zo,Z n ) is distributed as follows: d(Zo,Z n ) = 2M n — S n — 2k with probability 
2~ fc— l £ or integers k satisfying < k < min{M n , M n — S n } and the conditioned probability is 2~ k 
for k = min{M n , M n — S n }. Thus, setting £ = min{M n , M n — S n } we have that 

E(d(Z ,Z n ) 2 \ Z ,v,M n ,S n ) = 

> 



Taking expectations and applying Jensen's inequality gives 

Ed(Z ,Z n ) 2 > E(2 M n - S t ) 2 - 8 (E(2 M n - S n ) 2 ) 1/2 . 

Now, (JT3J) yields 

Ed(Z ,Z n ) 2 > 3(l - j-^)n-8V3n. 

By considering, say, h = n 2 , we deduce that sup h M^Th)* 2 > 3. It follows that ik^I^) 2 > 3 for the 
infinite 3-regular tree I^, since it contains all the finite trees T^. 

6 Embeddings in products of R-trees and proof of Theorem 11.71 

In what follows, given two metric spaces (X,dx) and (Y,dy), the metric space X x Y is always 
assumed to be equipped with the metric d((x, y), (a/, y')) = dx(x, x') + dy{y-, y')- 

A metric dona space X is a path metric if for every x, y E X there is a path in X from x to y 
whose length is d(x,y). 

An M-tree is a path metric space (T, d) such that for every two distinct points x,y € T there is 
a unique simple path from x to y in T (see |131 154j ). (Some definitions appearing in the literature 
also require the metric to be complete.) Equivalently, an R-tree is a 0- hyperbolic metric space 
whose metric is a path metric. 

An r '-separated set A in a metric space (X, d) is a subset iCI such that d(x, x') > r for every 
x ^ x' in A. An r-net is a maximal r-separated set. If A C X is an r-net, then X C |J agj 4 B(a, r). 
Clearly, every metric space has an r-net for every r > 0. 

Lemma 6.1. Fix an integer n and let Z be an n- dimensional normed space. Let (X, d) be a metric 
space and D,e > 0. Assume that every ball of radius e in X embeds bi-Lipschitzly in Z with 
distortion at most D. Then there is an integer N = N(n, D), a constant A = A(n, e, D) < 00 and 
a mapping F : X — > Z N which is Lipschitz with constant A and for every x, y E X 

d(x,y)<^ d(x,y) < \\F(x) - F(y)\\. 

Proof. Write Xq := X. Having defined C X, let A^ be an e-net in Aj. Define Xi + \ = 
Xi \ \J xeA _ B(x,e/4:). Observe that if x € X{ then for all j G {1, + 1} there is a point 
a,j £ Aj such that d(x,aj) < e. Moreover, for j / j' , d(aj,dj>) > e/4. Our assumption is 
that for every x £ X there is a function ^ : B(x,e) — > Z such that ip x ( x ) = and for all 



£-1 



2~^ 1 (2 M n - S n - 2 k) 2 + 2^(2 M n - S n - 2> 



k=0 



4A- 



(2M n - S n ) 2 -(2M n -S n )'^ ? 



k=0 



(2M n - S n ) 2 - 8(2 M n -S n ). 
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y,z E B(x,e) we have d(y,z) < \\ip x (y) — ip x (z)\\ < Dd(y,z). Then H^a;^)!! — £ D an d the 
balls B(ip x (aj), e/8) C B(0,e(D + 1/8)) C Z are disjoint. Comparison of volumes therefore gives 
i + 1 < (8-D + l) n . We have shown that there exists an integer iV < (8D + l) n and disjoint subsets 
A\,..., An C X such that for each j, if a, 6 E A,- then <i(a, b) > e and X C UjLi UaeA B(a,e/A). 
For every j < N define for a E A/ and x E B(a,e/2), 



ip a (x) if d(x,a) < 3e/8, 

(4-8e" 1 (i(x,a)) Va(x) if 3e/8 < d(x, a) <e/2. 



We also set fj to be Oonl\ UaeA -^( x > e /2)- (Observe that this defines a function on X since the 
balls {-B(a, e/2)} ag ^. are disjoint.) It is straightforward to verify that fj is Lipschitz with constant 
AD. 

Now define F : X — > by i 7 = /i © /2 © • • • ® /at • Then F is Lipschitz with constant 
AN D. Moreover, fix x,y E X with d(x,y) < e/8. There exists 1 < j < iV and a E such that 
d(x, a) < e/A. Hence d(y, a) < d(x, y) + d(x, a) < 3e/8; so that x, y E B(a, 3 e /8) and 

\\F{x) - F(y)\\ > WfjW-fjWW = \\Mx)-My)\\ > d(x,y). 

□ 

In what follows, a subset A of a metric space X is called e- dense, if for every x E X there is an 
a E .A such that a) < e. 

Corollary 6.2. Lei c E (0,1]. Lei X, 1" be metric spaces, Z an n- dimensional normed space, 
A C X an e dense subset and ip : X — > Y a 1 Lipschitz mapping such that for every a,b E A, 
d(ip(a) , ip(b)) > cd(a,b). Assume that every ball of radius ^ in X embeds bi-Lipschitzly in Z with 
distortion at most D. Then there exists an integer N = N(n, D) and a K = K(n, e, c, D) such that 
X embeds bi-Lipschitzly with distortion K into Y x Z . 

Proof. Let F, N and A be as in Lemma fo. ll applied with e replaced by 64 e jc. Define g : X — > Y x Z N 
by g(x) = (if(x), F(x)). Then g has Lipschitz constant bounded by A + 1. Moreover, if x,y E X 
are such that d(x,y) < 8e/c then d(g(x), g(y)) > \\F(x) — F(y)\\ > d(x,y). If, on the other hand, 
d(x,y) > 8e/c, then take x',y' E A satisfying d(x,x') < e and d(y,y') < e. It follows that 

d{g(x),g(y)) > d((p(x'), tp(y')) - d(tp(x), ip(x')) - d(ip(y), ip(y')) 

> cd{x \y) - d{x,x') - d(y,y) 

> c [d(x, y) - d(x, x') - d(y, y')] - d(x, x') - d{y, y') 

> cd(x,y) - 2e(c + 1) > -d(x,y). 

□ 

The following lemma is known (it follows, for example, by specializing the results of |33| ) . 
However, since we could not locate a clean reference, we will include a short proof. 

Lemma 6.3. Let X be a metric space, and let <fi : A — > T be a Lipschitz map from a subset A C X 
to a complete M-tree T . Then (j) may be extended to a map <p : X — > T that agrees with <p on A and 
has the same Lipschitz constant as <p. 
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Proof. With no loss of generality, assume that the Lipschitz constant of is 1. We will use dx and 
d,T to denote the metrics in X and T, respectively. Consider first the case where X contains only 
one point that is not in A. Say, X = A U {xo}. For any point a £ A, let B a be the closed ball 
B[6{a), dx(xo, a)) in T. Then any point in f] a&A B a may be chosen as 4>(xq). Thus, we have to 
show that this intersection is nonempty. Consider a, a' £ A. The triangle inequality in X shows 
that the sum of the radii of the two balls B a , B a i is at least as large as the distance between the 
centers. Thus, B a n B a > contains a point on the unique simple path joining (p(a) and <p{a') in T. 

We claim that M-trees satisfy the following Helly-type theorem. If F is a nonempty finite 
collection of convex subsets of T and every two elements F,F' £ F intersect, F D F' 7^ 0, then the 
whole collection intersects, (~) F£ -pF 7= 0. (Here, F convex means that the metric of T restricted to 
F is a path metric.) Indeed, suppose first that F = {iq , F2, F3}. Let t\ £ F2 D -F3, £2 G -^1 H F3 and 
£3 € iq ni 7 ^. Since there are no cycles in T, it follows that the three paths, one joining t\ to £2, one 
joining t<i to £3 and one joining t\ to £3 intersect. This intersection point will be in F\ n -F2 H -F3. 
Now, suppose that F = {iq,F2, . . . ,F n }, where n > 3. By the previous case, we know that any 
two sets in the collection F' := {F2 D iq, F3 fl iq, . . . , F n D iq} intersect. Induction then implies 
that the whole collection F' intersects, which implies that F intersects. 

Since balls in T are clearly convex, it follows that every finite subcollection of {B a } a( z A intersects. 
To prove that f] a&A B a 7^ 0, we must invoke completeness. Suppose now that B(t,r) and B(t',r') 
are two closed balls in T which intersect and none of these contains the other. Observe that 
B(t,r) n B(t',r') is also a closed ball in T whose center is the unique point in T at distance 
(dr{t,t') + r — r')/2 from t and at distance (dr(t,t') + r' — r) /2 from i', and whose radius is 
(r+r' — /2 (here, a single point is considered as a ball of zero radius). Thus, the intersection 

I (A 1 ) := flasA' Ba-, where A' C A is finite, is a nonempty ball. Let be the infimum radius of 
any such ball, and let a±, 02, . . . be a sequence in ^4 such that the radius of the ball I({ai, . . . , a n }) 
converge to r^,. Let c n be the center of I({a±, . . . , a n }) and let r n be the radius. If r ro > 0, then 
the above formula for the radius of the intersection of two balls shows that c n £ B a for every 
a £ A and for every n such that r n < 2^. Thus, clearly, f] aeA B a / in this case. In the case 
^00 = 0, we have r n \ 0, and it follows that c n is a Cauchy sequence. Invoking completeness, we 
conclude that the limit Cqo := lim n c n exists. It follows that Cqo G B a for every a € A, because 
I({a, a±, . . . , a n }) 7^ for every n. This completes the proof in the case X = Au {xo}. The general 
case follows by transfinite induction. □ 

Theorem 6.4. Fix 5 > 0, and assume that X is a 5-hyperbolic metric space whose metric is a path 
metric. Assume that there exists D,e > and n £ N such that every ball of radius e in X embeds 
in W 1 with distortion at most D. Then there exists an integer N and a K > such that X embeds 
bi-Lipschitzly into a product of N M-trees with distortion at most K. 

The trees in the statement of the theorem are infinite degree simplicial trees whose edge lengths 
may be taken as 1. Though we will not need this, one can actually prove this with bounded degree 
trees [H] of edge length 1. 

Proof. Since balls of radius e embed in W 1 with distortion at most D, there is an upper bound 
(depending only on n and D) for the cardinality of any e/2-separated set in X whose diameter is 
less than e. In the terminology of 0, this means that X has bounded growth in some scale. By 
a theorem of Bonk and Schramm |7j , there exists an integer m such that X is quasi-isometric to a 
subset of the m dimensional hyperbolic space H m . This means that there are constants a, a' ,b > 
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and a mapping / : X — ► H m such that for all x, y G X, 

a dfo y) - 6 < d(/(x), /(y)) < a' d(x, y) + 6. 

(Actually, one may even take a = a'.) By a theorem of Buyalo and Schroeder there is an 
K-tree T such that HP 71 is quasi-isometric to a subset of T m . It follows that there is some R > 0, 
an i?-net A of X and a bi-Lipschitz embedding g : A — > T m . Since the completion of an R-tree 
is an R-tree, we may as well assume that T is complete. Lemma 16.31 implies then that there is a 
Lipschitz extension <p : X — > T m of g. By scaling the distances in T we may also assume that ip is 
1 Lipschitz and for every x,y G A, d(ip(x) , <p(y)) > cd(x,y) for some constant c > 0. 

Let A' be an e/8 net in X. Set \I/(r) := sup agj4 / n l?(a,r)|. We claim that \P(r) < oo for 
every r > 0. This holds true for r = e, because balls of radius e in X embed in M n with distortion at 
most D. Now let a± € A', and let x £ B(a\, r). Since X has a path metric, there is a point x' £ X 
satisfying d(x',ai) = d(x',x) = d(a\,x)/2 < r/2. Let a' be a point in A' satisfying d(a',x') < e/8. 
Then a' G B(ai, r/2 + e/8) and x G B(a', r/2 + e/8). Thus, 

#(ai,r)C (J S(o',r/2 + e/8). 

a'GA'nB(ai,r/2+e/8) 

Consequently, ^(r) < ^(r/2 + e/8) 2 , which now implies that ^(r) < oo for every r. 

We now show that for every r > there is an n' = n'(r) and a D' = D'(r), both finite, such 
that every ball in X of radius r embeds in R n with distortion at most D' . Indeed, let x € X. Set 
A' x = A' x (r) := A' n B(x, r). Clearly, \A' X \ < ^(r + e/8). For every a G A' let : S(a, e) M n be 
a bi-Lipschitz embedding with distortion at most Z) satisfying ip a {a) = 0, and let 

i/j a (%) if d(a,x) < e/2, 

- < 2 (1 - e~ l d(a, x)) ip a (x) if e/2 < d(a, x) < e, 
otherwise. 

Now the required bi-Lipschitz embedding from B(x, r) into W 1 ' is given by x i— ► (</'a( ;z; )) ag ^4/ (with 
a padding of zeros to make re' independent from cc). It is immediate that this maps satisfies the 
requirements. Now an application of Corollary 16.21 completes the proof of the theorem (since the 
real line R itself is an M-tree). □ 

The following corollary contains Theorem 11.71 

Corollary 6.5. Let X be an n dimensional complete simply connected Riemannian manifold with 
pinched negative sectional curvature (i.e., its sectional curvature takes values in the compact interval 
[-R, —r] C (— oo,0)). Then there is an integer N = N(n,r,R) and D = D(n,r,R) > such that 
X embeds bi-Lipschitzly into a product of N trees with distortion D. In particular, by Theorem \t.J\ 
X has Markov type 2 and M2(X) can be bounded by a function ofn,r,R. 

Proof. It is a standard fact [H] that X is 5 hyperbolic (with 5 proportional to 1/r). The fact that 
there exists e > such that all the balls of of radius e in X embed bi-Lipschitzly in W 1 follows from 
Rauch's comparison theorem (see J2j and Chapter 8+ in |2l]). Thus the required result follows 
from Theorem 16.41 □ 
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Figure 1: The Laakso graphs. 



7 The Laakso graphs, doubling spaces and weak Markov type 

Recall that a metric space X is said to be doubling with constant A if every ball in X can be 
covered by A balls of half the radius. Assouad's theorem [J states that if (X, d) is doubling with 
constant A then for every e > 0, (X, d 1_e ) embeds in Hilbert space with distortion C(X,s) < oo. 
Thus X has Markov type 2 — e with constant depending only on A and e. Similarly, it is shown 
in [32] that if G is a planar graph equipped with the graph metric da, then for every e £ (0, 1), 
(G,d^~ £ ) embeds in Hilbert space with distortion 0(1/ y/e) (and the dependence on e is optimal). 
Thus G has Markov type 2 — e with constant 0(l/y/e). Is it true that a metric space with doubling 
constant A has Markov type 2 with constant depending only on A? Similarly, is it true that planar 
graphs have Markov type 2 with a universally bounded constant (this would be a generalization of 
our theorem on trees)? More generally, is it true that a Riemannian surface of genus g has Markov 
type 2 with constant depending only on gl The above embedding results are all optimal, so a proof 
of the Markov type 2 property for such spaces cannot go through embeddings in Hilbert space. The 
standard example showing that in both embedding results we must pass to a power of the metric 
is the family of graphs known as the Laakso graphs |37l 138) , which are planar graphs whose graph 
metric is uniformly doubling, yet they do not uniformly embed into Hilbert space. These graphs 
Gk are defined recursively as in Figure ^ 

Proposition 7.1. The Laakso graphs are uniformly of Markov type 2; that is sup^. A^Gfe) < oo. 

This proposition may be viewed as some very limited indication that doubling spaces and planar 
graphs have Markov type 2. 

Proof. Fix k £ N. Let r be the leftmost point of Gk, and for an arbitrary vertex v in Gk set 
\v\ := d(v, r). We claim that if Vo, v\, . . . , v t is a path in Gk, then there is some j G {0, 1, ... , t} 
such that 

d(vo,v t ) < \\v \ - \vj\\ + \\vj\ - \v t \\. (16) 

Indeed, let r 1 be the rightmost endpoint of Gk- Note that for every vertex v in Gk we have 
d(v,r') = \r'\ — \v\. If vq and v n are on a geodesic path from r to r', then d(vo,vt) = \ \vq\ — \vt\\, 
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and hence we may take j = 0. We now assume that there is no geodesic from r to r' containing 
{vo, v t }. Let [5 be a geodesic path from v$ to r, and let 7 be a geodesic path from v t to r. Let w 
be the vertex in (3 D 7 at maximal distance from r. Similarly, we define it' in the same way with r' 
replacing r. It is easy to verify that {u, u'} separates from vt and separates {vo,v t } from {r,r'}. 
Consequently, there is some j G {0, 1, . . . , t} such that Vj G {u, u'} and that j satisfies (|16j) . 

Now suppose that Zq, Z±, . . . , Zt are steps of a reversible stationary Markov chain on {1, . . . , n} 
and g is a mapping from {0,1,..., n} to the vertices of G^. Set Wj := g(Zj). For each i = 
1,2, consider a path in from Wi_i to PVj whose length is d(Wj_i,Wj). Putting these 

paths together gives a path (uq, . . . ,v n ) passing through the points Wo, . . . ,Wt whose length 
n is Yti=id{Wi, Wi-i). Set j { := ^ l s=1 d(W s , W a -i). Then Vji = W t . Let j satisfy (O and let 
Iq G {1, . . . ,t} be such that ji _i < j < jj . Then we have 

d(W ,Wi) < ||Wb|-hl| + ||t;j|-|W t || 

< ||Wo| - |Wio_i|| + d(Wio_i, Wio) + IIWi.,1 - I Will 



Thus 



< 2 ||Wo| - |W i0 _i|| + d(W i0 -!, W l0 ) + \\W \ - \W t 



d(W ,W t ) 2 < 3{2\\W \-\W l0 . 1 \\) 2 + 3d(W i0 . 1 ,W i0 ) 2 + 3\\W \-\W t \\ 2 



t 

< 15 max{(|Wb|-|Wi|) 2 } + 3y;d(Wi_i,Wi) 5 



Consequently, the proof is completed by two applications of Lemma 15.11 one with f(Z) = \g(Z)\ 
and the other with f(Z) = —\g(Z)\. □ 

It turns out that in many situations a similar argument proves a weak version of Markov type 2. 

Definition 7.2. A metric space X has weak Markov type 2 if there is a finite constant C such that 
for every reversible stationary Markov chain Z on {1, . . . , n} and every map f : {1, . . . ,n} — > X, 
we have 

Vt G N,C > Pr (d(f(Z ),f(Z t )) 2 >t()< C 2 C 1 E(d(f(Z ),f(Z 1 )) 2 ) . 

The least C satisfying this inequality will be called the weak Markov type 2 constant of X and will 
be denoted Mf{X). 

Note that Chebyshev's inequality gives M 2 {X) > M%{X). 

Theorem 7.3. Let X be a metric space with doubling constant A < 00. Then X has weak Markov 
type 2, and M™(X) is bounded by a finite function of A. 

Proof. Let R > and let A be an R-net in X. Set E = {{a, a') G A x A : d(a,a') < 16 R}. Then 
G := (A,E) is a graph. Since every ball of radius 16 R in X can be covered by A 5 balls of radius 
R/2, it follows that the maximal degree in G is at most A 5 . Thus, the chromatic number of G is 
at most A 5 + 1. Consequently, there is a partition A = U J= i -4? °f A into (16 i?)-separated subsets 
with N < X 5 + 1. Define fj(x) := d(x, Aj), j = 1, 2, . . . , N. Then fj(x) - fj(y) < d(x, y) holds for 
x,y G X. 
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Now suppose that x,y £ X satisfy d(x,y) £ [3R, 4R]. Let a £ A he such that d(a,x) < R 
and let j £ {1, . . . ,N} be the index such that a £ Aj. Then fj(x) < d(a,x) < R. Note that 
d{a,y) < d(a,x) + d(x,y) < 5 R, and since Aj is (16 i?)-separated, it follows that fj(y) = d(a,y) > 
d(y, x) — d(a, x) >2R. 

Let (Zt) be a reversible stationary Markov chain on {1, 2, . . . , n} and let / : {1, 2, . . . , n} — > 
X. Set Wi := f(Zi). Fix t £ N. Let A be the event that d{W ,W t ) > AR, let Bj be the 
event that d(Wi_i,Wi) > i?, let C, be the event d(W ,Wi) £ [3R,4R], and let C{ be the event 
\fj(Wo)-f j (W i )\>2R. Clearly, 

Ae (u*)u(uft)- 

The previous paragraph shows that 

N 

Consequently, 

Pr(^) < ^Pr(^) + ^Pr max |/,(^o) - /i(W 4 )| > 2 i? J . 

The first of these sums is bounded by t E(d(Wo, VFi) 2 ) i? -2 , while Lemma I5~T1 shows that the second 
sum is bounded by 50 N t K(d(Wo, W\) 2 ) R~ 2 . The theorem follows by choosing R to satisfy 16 R 2 = 
t£. □ 



Let (X, dx), (Y, dy ) be metric spaces. We shall say that (X,dx) embeds weakly into (Y,dy) 
with distortion K if for every A > there is a 1 Lipschitz mapping g& : X — > Y such that if x, y £ X 
satisfy d x (x, y) > A then d Y {g&{x), g/±{y)) > A/K. Observe that in this case M%(X) < KM%>(Y). 
Indeed fix a reversible stationary Markov chain Z on {1, . . . ,n} and a map / : {1, ... ,n} — ► X. 
For every time t £ N and every ^ > 0, 

Pr(dx(f(Z )J(Z t )) 2 >t() < Pr(d Y (g^(f(Z )),g^(f(Z t ))) 2 >^\ 

< M%{Y) 2 K 2 C l ^(d Y ( gvK {f(Z,)), gvK {f{Z t ))) 2 ) 

< M^(Y) 2 K 2 C 1 ^{d x (f(Z ),f(Z t )) 2 ). 

It follows from the results of |42[ I35| that if X is doubling with constant A then X embeds weakly 
into Hilbert space with distortion 0(log A). This yields an alternative proof of Theorem 17.31 with 
the concrete estimate M™(X) = O(logA). Moreover, the results of [121 ESI (specifically, see the 
proof of Lemma 5.2 in [12]) imply that any planar graph embeds weakly into Hilbert space with 
O(l) distortion. More generally, in combination with Corollary 3.15 in [U, any Riemannian surface 
5 of genus g embeds weakly into Hilbert space with distortion 0(g + 1). Thus M|°(5) = 0(g + 1). 
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8 Discussion and open problems 



1. In jSH] (see also a related previous result in [H]) it is shown that for every p > 1, if a Banach 
space X has Rademacher type p then it also has Enflo type q for every q < p. No such result 
is known for Markov type. In j^E] it is shown that if X is a UMD Banach space (see |lf)j 
for details on UMD spaces) of Rademacher type p, then X also has Enflo type p. It would 
be desirable to obtain a result stating that for a certain class of Banach spaces, the notions 
of Rademacher type p and Markov type p coincide (or almost coincide). The most daring 
conjecture would be that for every Banach space, Rademacher type p implies Enflo type p, or 
even Markov type p. This amounts to proving that for Banach spaces of type greater than 1 
(also known as ii'-convex space. See [2E] for the geometric and analytic ramifications of this 
assumption), the Rademacher type and Enflo type (or Markov type) coincide. 

One simple example of a class of spaces for which we can prove that there is a strong connection 
between Rademacher type and Markov is Banach lattices. A Banach lattice is a Banach space 
(X, || • ||) which is partially ordered and satisfies the following axioms. For every x,y,z £ X, 
if x < y then x + z < y + z, and for every scalar a £ [0, oo), x > implies that ax > 0. It is 
also required that for all x, y € X there exists a least upper bound x V y and a greatest lower 
bound x A y. For x £ X denote |x| = x V (— x). The final requirement is that the partial 
ordering is compatible with the norm in the sense that if |x| < \y\ then ||x|| < ||y||. Examples 
of Banach lattices are the classical function and sequence spaces, with the point-wise partial 
order. We refer to [H] for an account of the beautiful theory of Banach lattices. 

A combination of a theorem of Figiel |21j and a theorem of Maurey |5()j (see Theorem l.f.l. 
and Proposition l.f.17. in [11]) implies that a Banach lattice X of type 2 can be renormed to 
have a modulus of smoothness of power type 2. Thus by Theorem 12 ..SI X has Markov type 2. 

2. Under what conditions on a metric space does Enflo type p imply Markov type pi 

3. Is it true that if a metric space has Markov type p then it also has Markov type q for every 
q < pi For normed spaces this is indeed the case, by a straightforward application of Kahane's 
inequality [3*2] . 

4. We conjecture that the factor of 24 in Theorem l2.4l is redundant. In particular it seems likely 
that for 2 < p < oo and 1 < q < 2, e(L p , L q ) < ^J{p — \)/{q — 1). If true, this would be a 
generalization of Kirszbraun's classical extension theorem |34| (see also |65l Ifi]). 

5. Since L\ has cotype 2 but isn't uniformly convex, there is no known non-linear analog of 
Maurey's extension theorem for Li-valued mappings. In particular, it isn't known whether 
e(L2,Li) is finite or infinite. 

6. What is the best Markov type 2 constant for trees? More precisely, define M2(tree) to be 
supM2(T) over all trees T. (It is clear that this sup is a max.) One can show using the 
methods of the present paper that M2(tree) < 8. The example in Subsection 15.11 shows that 
M 2 (tree) > y/3. 

7. As discussed in Section [JJ we believe that planar graphs and doubling spaces have Markov 
type 2. Also, it seems likely that CAT(O) spaces have Markov type 2 (see [8] for a discussion 
of CAT(O) spaces). 
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8. Say that a metric space X has maximal Markov type p, if there exists a constant K 
such that for every finite stationary reversible Markov chain {Z(}^ on {1, . . . , n} and every 
mapping / : {1, . . . , n} — > X, we have 

E max d(f(Z s ),f(Z Q )f < K p tMd(f(Zi), f(Z )) p . 

l<s<t 

for all t 6 N. In all the cases in which we proved that a metric space X has Markov type 2, the 
argument actually shows that it has maximal Markov type 2. This was explicit in the proofs 
for trees and hyperbolic spaces. To see this in the setting of Banach spaces with modulus of 
smoothness of power type 2, it suffices to note that Doob's Li maximal inequality (jlljl is also 
valid for a martingale {M s } s >q in a Banach space, since by Jensen's inequality, {||M s ||} s >o is 
a submartingale for the same filtration. We do not know whether in general, Markov type p 
implies maximal Markov type p. 

Acknowledgement. We are grateful to Russ Lyons for helpful discussions at an early stage of 
this work, and to Terry Lyons for sending us his paper with T. S. Zhang. 
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